Mirex 2012: Chord Recognition Using Duration-explicit Hidden Markov Models
نویسندگان
چکیده
We present an audio chord recognition system based on a generalization of the Hidden Markov Model (HMM) in which the duration of chords is explicitly considered a type of HMM referred to as a hidden semi-Markov model, or duration-explicit HMM (DHMM). We find that such a system recognizes chords at a level consistent with the state-of-the-art systems – 84.23% on Uspop dataset at the major/minor level. The duration distribution is estimated from chord duration histograms on the training data. It is found that the state-of-the-art recognition result can be improved upon by using several duration distributions, which are found automatically by clustering song-level duration histograms. The paper further describes experiments which shed light on the extent to which context information, in the sense of transition matrices, is useful for the audio chord recognition task. We present evidence that the context provides surprisingly little improvement in performance, compared to isolated frame-wise recognition with simple smoothing. We discuss possible reasons for this, such as the inherent entropy of chord sequences in our training database.
منابع مشابه
Chord Recognition Using Duration-explicit Hidden Markov Models
We present an audio chord recognition system based on a generalization of the Hidden Markov Model (HMM) in which the duration of chords is explicitly considered a type of HMM referred to as a hidden semi-Markov model, or duration-explicit HMM (DHMM). We find that such a system recognizes chords at a level consistent with the state-of-the-art systems – 84.23% on Uspop dataset at the major/minor ...
متن کاملMirex 2010: Joint Recognition of Key and Chord from Music Audio Signals Using Key-modulation Hmm
This extended abstract describes a submission to the Music Information Retrieval Evaluation eXchange 2010 (MIREX 2010) in the Audio Chord Estimation and Audio Key Detection tasks. We propose a new model to recognize musical keys and chords simultaneously from musical acoustic signals including key modulations. Chords and keys are closely related notions of music involving harmony. Since occurre...
متن کاملImproved Automatic Chord Recognition
This paper describes a chord recognition system submitted to the MIREX 2009 Audio Chord detection contest. Extracting harmonic information from audio signals has become a topic of keen interest for many researches in Music Information Retrieval (MIR) community. The FBK submission consists of two chord detection system: baseline and the system with language modeling functionality. The two submis...
متن کاملThe 2010 Labrosa Chord Recognition System
For the MIREX 2010 Audio Chord Extraction task, we submitted a total of four systems. Our base system is a trainable chord recognizer based on two-band chroma representations and using a Structured SVM classifier to replace the more familiar hidden Markov model. We submit two versions of this system, one which transposes all training data through all 12 possible chords to maximize the training ...
متن کاملImproving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کامل